1,938 research outputs found

    Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences

    Full text link
    Speaking rate refers to the average number of phonemes within some unit time, while the rhythmic patterns refer to duration distributions for realizations of different phonemes within different phonetic structures. Both are key components of prosody in speech, which is different for different speakers. Models like cycle-consistent adversarial network (Cycle-GAN) and variational auto-encoder (VAE) have been successfully applied to voice conversion tasks without parallel data. However, due to the neural network architectures and feature vectors chosen for these approaches, the length of the predicted utterance has to be fixed to that of the input utterance, which limits the flexibility in mimicking the speaking rates and rhythmic patterns for the target speaker. On the other hand, sequence-to-sequence learning model was used to remove the above length constraint, but parallel training data are needed. In this paper, we propose an approach utilizing sequence-to-sequence model trained with unsupervised Cycle-GAN to perform the transformation between the phoneme posteriorgram sequences for different speakers. In this way, the length constraint mentioned above is removed to offer rhythm-flexible voice conversion without requiring parallel data. Preliminary evaluation on two datasets showed very encouraging results.Comment: 8 pages, 6 figures, Submitted to SLT 201

    Minimally invasive strategy for gynecologic cancer with solitary periacetabular metastasis

    Get PDF
    SummaryTumor with bone metastases to the periacetabulum is rare, and its surgical management is challenging. Instead of wide excision with reconstruction of the hip joint, we used a relatively noninvasive method to manage periacetabular metastasis. Such a procedure for this condition has the benefits of short surgical time, less bleeding, and fewer complications during surgery. Our surgical management of the case reported here included curettage, phenol cauterization and filling of cisplatin-loaded cement in order to reduce local recurrence. After following-up for 2 years, there was no local recurrence and disease progression

    Prognostic diagnosis of the health status of an air-turbine dental handpiece rotor by using sound and vibration signals

    Get PDF
    This paper reports the diagnostic results of a free-running of air turbine dental handpiece (ATDH) with three rotor statuses by applying fast Fourier transform (FFT), Hilbert-Huang transform (HHT), and multiscale entropy (MSE) processes. The proposed method was tested under conditions of additional axial preload on the rotor and ceramic bearings with a damaged outer race supporting the rotor. A laser-Doppler vibrometer, condenser microphone, and portable MEMS system microphone were used to acquire the signals when the ATDH rotor features were changed. The results showed that changes in preload or malfunctioning ball bearings can be discriminated and abstracted using FFT and HHT to analyze the vibration frequencies. The experimental results showed that the proposed method can successfully predict the prognostic status of an ATDH rotor. The smart sensing of the health of the ATDH was achieved through a comparative evaluation of the MSE values. The proposed diagnostic method yielded satisfactory prognostic effectiveness in predicting the health status of the tested ATDH rotor

    An All Deep System for Badminton Game Analysis

    Full text link
    The CoachAI Badminton 2023 Track1 initiative aim to automatically detect events within badminton match videos. Detecting small objects, especially the shuttlecock, is of quite importance and demands high precision within the challenge. Such detection is crucial for tasks like hit count, hitting time, and hitting location. However, even after revising the well-regarded shuttlecock detecting model, TrackNet, our object detection models still fall short of the desired accuracy. To address this issue, we've implemented various deep learning methods to tackle the problems arising from noisy detectied data, leveraging diverse data types to improve precision. In this report, we detail the detection model modifications we've made and our approach to the 11 tasks. Notably, our system garnered a score of 0.78 out of 1.0 in the challenge.Comment: Golden Award for IJCAI CoachAI Challenge 2023: Team NTNUEE AIoTLa

    Reliability of flexible low temperature poly-silicon thin film transistor

    Get PDF
    This work reports the effect of mechanical stress-induced degradation in flexible low-temperature polycrystalline-silicon thin-film transistors. After 100,000 iterations of channel-width-direction mechanical compression at R=2mm, a significant shift of extracted threshold voltage and an abnormal hump at the subthreshold region were found. Simulation reveals that both the strongest mechanical stress and electrical field takes place at both sides of the channel edge, between the polycrystalline silicon and gate insulator. The gate insulator suffered from a serious mechanical stress and result in a defect generation in the gate insulator. The degradation of the threshold voltage shift and the abnormal hump can be ascribed to the electron trapping in these defects. In addition, this work introduced three methods to reduce the degradation cause by the mechanical stress, including the quality improvement of the gate insulator, organic trench structure and active layer with a wing structure. Please click Additional Files below to see the full abstract

    Structural and cognitive deficits in chronic carbon monoxide intoxication: a voxel-based morphometry study

    Get PDF
    BACKGROUND: Patients with carbon monoxide (CO) intoxication may develop ongoing neurological and psychiatric symptoms that ebb and flow, a condition often called delayed encephalopathy (DE). The association between morphologic changes in the brain and neuropsychological deficits in DE is poorly understood. METHODS: Magnetic resonance imaging and neuropsychological tests were conducted on 11 CO patients with DE, 11 patients without DE, and 15 age-, sex-, and education-matched healthy subjects. Differences in gray matter volume (GMV) between the subgroups were assessed and further correlated with diminished cognitive functioning. RESULTS: As a group, the patients had lower regional GMV compared to controls in the following regions: basal ganglia, left claustrum, right amygdala, left hippocampus, parietal lobes, and left frontal lobe. The reduced GMV in the bilateral basal ganglia, left post-central gyrus, and left hippocampus correlated with decreased perceptual organization and processing speed function. Those CO patients characterized by DE patients had a lower GMV in the left anterior cingulate and right amygdala, as well as lower levels of cognitive function, than the non-DE patients. CONCLUSIONS: Patients with CO intoxication in the chronic stage showed a worse cognitive and morphologic outcome, especially those with DE. This study provides additional evidence of gray matter structural abnormalities in the pathophysiology of DE in chronic CO intoxicated patients

    Abdominal Tuberculosis in Adult: 10-Year Experience in a Teaching Hospital in Central Taiwan

    Get PDF
    Background/PurposeTuberculosis (TB) is an important communicable disease worldwide. The clinical presentation of abdominal TB often mimics various gastrointestinal disorders and may delay accurate diagnosis. In this study, we conducted a 10-year retrospective study to investigate the clinical manifestations, treatment responses and outcomes of abdominal TB.MethodsThis retrospective study recruited patients presenting between January 1998 and December 2007; all patients ≥ 18 years of age with a diagnosis of abdominal TB were enrolled. Patient charts were thoroughly reviewed and clinical specimens were processed in the laboratory using the BBL MycoPrep System and BACTEC MGIT 960 Mycobacterial Detection System. Mycobacterium tuberculosis complex was confirmed by acid fast stain and the BD ProbeTec ET System.ResultsDuring the study period, 34 patients were diagnosed with abdominal TB. The mean age was 55+18 years. Fourteen patients (41%) had no risk factors; however, 20 patients (59%) had at least one risk factor. Abdominal pain (94.1%), abdominal fullness (91.2%), anorexia (88.2%) and ascites (76.5%) were the most common presenting symptoms. The peritoneum (88%) was the most commonly involved site. Patients with risk factors such as liver cirrhosis, end-stage renal disease and diabetes mellitus had a higher positive rate of acid-fast stain and mycobacterial culture from abdominal specimens (p = 0.02 and 0.05, respectively). The crude mortality rate was 9% and the attributed mortality rate was 3%.ConclusionIn an endemic area like Taiwan, regardless of whether a patient has risk factors for TB, abdominal TB should be seriously considered as a differential diagnosis when a patient presents with gastrointestinal symptoms and unexplained ascites
    • …
    corecore